The problems of punctuation ambiguity in fully automatic text-to-speech conversion

نویسنده

  • Mike McAllister
چکیده

Fully automatic text-to-speech systems must accept as input any texts in whatever form they might be stored on a computer. As such, the role of punctuation characters in marking sentences, phrases and other textual constructs has to be exploited to produce natural sounding synthetic speech. Some characters not in the alpha-numeric set can, however, act both as text and as punctuation in different situations. A pre-processing module has therefore been implemented which is sensitive to these different roles and attempts to use them in preparing texts for text-to-speech conversion.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Development and Evaluation of Automatic Punctuation for French and English Speech-to-Text

Automatic punctuation of speech is important to make speechto-text output more readable and to facilitate downstream language processing. This paper describes the development of an automatic punctuation system for French and English. The punctuation model uses both textual information and acoustic (prosodic) information and is based on adaptive boosting. The system is evaluated on a challenging...

متن کامل

A System Description of P^4: Possible Punctuation Points Parser

We present a Natural Language Understanding (NLU) implementation that automatically inserts punctuation marks into a sequence of words to create a group of one or more syntactically correct sentences. The software, Possible Punctuation Points Parser (P^4) provides the ability for the user to input a string of words to process, performs the punctuation possibilities, and then provides several vi...

متن کامل

Punctuation has a point, so use it!

It is all too common for systems processing natural language, whether for input (automatic speech recognition, text queries, dialogue etc.) or output (text-to-speech), to ignore or strip out punctuation. The effect of prosodic factors, such as intonation and pausing, on language processing remains controversial. While there is an obvious relationship between punctuation and prosody it cannot be...

متن کامل

Punctuation Prediction with Transition-based Parsing

Punctuations are not available in automatic speech recognition outputs, which could create barriers to many subsequent text processing tasks. This paper proposes a novel method to predict punctuation symbols for the stream of words in transcribed speech texts. Our method jointly performs parsing and punctuation prediction by integrating a rich set of syntactic features when processing words fro...

متن کامل

Automatic Recovery of Punctuation Marks and Capitalization Information for Iberian Languages

This paper shows experimental results concerning automatic enrichment of the speech recognition output with punctuation marks and capitalization information. The two tasks are treated as two classification problems, using a maximum entropy modeling approach. The approach is language independent as reinforced by experiments performed on Portuguese and Spanish Broadcast News corpora. The discrimi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1989